Search CORE

12 research outputs found

Tight Integrated End-to-End Training for Cascaded Speech Translation

Author: Bahar Parnia
Bieschke Tobias
Ney Hermann
Schlüter Ralf
Publication venue
Publication date: 01/01/2020
Field of study

A cascaded speech translation model relies on discrete and non-differentiable transcription, which provides a supervision signal from the source side and helps the transformation between source speech and target text. Such modeling suffers from error propagation between ASR and MT models. Direct speech translation is an alternative method to avoid error propagation; however, its performance is often behind the cascade system. To use an intermediate representation and preserve the end-to-end trainability, previous studies have proposed using two-stage models by passing the hidden vectors of the recognizer into the decoder of the MT model and ignoring the MT encoder. This work explores the feasibility of collapsing the entire cascade components into a single end-to-end trainable model by optimizing all parameters of ASR and MT models jointly without ignoring any learned parameters. It is a tightly integrated method that passes renormalized source word posterior distributions as a soft decision instead of one-hot vectors and enables backpropagation. Therefore, it provides both transcriptions and translations and achieves strong consistency between them. Our experiments on four tasks with different data scenarios show that the model outperforms cascade models up to 1.8% in BLEU and 2.0% in TER and is superior compared to direct models.Comment: 8 pages, accepted at SLT202

arXiv.org e-Print Archive

Publikationsserver der RWTH Aachen University

Anle138b: a novel oligomer modulator for disease-modifying therapy of neurodegenerative diseases such as prion and Parkinson’s disease

Author: A Gallardo-Godoy
A Giese
A Giese
A Pfeifer
Andreas A. Deeg
Andrei Leonov
Armin Giese
B Caughey
BS Gadad
C Korth
C Soto
C Wasmer
CA Lipinski
Catharina Prix
CG Glabe
Christian Griesinger
CR Birkett
CR Trevitt
DE Clark
DF Veber
DJ Selkoe
DP Karpinar
E Angot
E Masliah
E Tolosa
F Fornai
F Leidel
F Pan-Montojo
F Pan-Montojo
F Schmidt
Fabienne Leidel
Felix Schmidt
Francisco Pan-Montojo
G Bitan
GB Irvine
Gerda Mitteregger-Kretzschmar
GP Saborio
GR Lamberto
GR Mallucci
HA Lashuel
Hans Kretzschmar
Henning Urlaub
J Bieschke
J Bieschke
J Bieschke
J Castilla
J Collinge
J Collinge
J Safar
J Simon-Sanchez
JD Harper
JD Wadsworth
Jens Pilger
Jens Wagner
Jochen H. Weishaupt
Johannes Levin
JR Cannon
JR Streffer
Julian J. Krauth
Kai Bötzel
KC Luk
L Colombo
M Geissen
M Kostka
M Neumann
Manfred Uhr
Markus Geissen
Markus Zweckstetter
Martin Eiden
Martin Groschup
Mathias Bähr
Matthias Samwer
MH Groschup
MJ Thompson
MS Forman
MS Forman
P Brown
P Desplats
P Mueller
P Weber
Paul Tavan
R Demaimay
S Connelly
S Ghaemmaghami
S Hüls
S Tzaban
SB Prusiner
SB Prusiner
Sergey Ryazanov
Song Shi
T Högen
Thomas Hirschberger
Tobias Frank
U Bertsch
Ulrike Teichmann
Uwe Bertsch
V Gayrard
Wolfgang Zinth
Y Kawasaki
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

A comparative study on end-to-end speech to text translation

Author: Bahar Parnia
Bieschke Tobias
Ney Hermann
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2019
Field of study